A comparative Study of Outlier Mining and Class Outlier Mining

نویسندگان

  • Motaz K. Saad
  • Nabil M. Hewahi
چکیده

Outliers can significantly affect data mining performance. Outlier mining is an important issue in knowledge discovery and data mining and has attracted increasing interests in recent years. Class outlier is promising research direction. Few researches have been done in this direction. The paper theme has two main goals: the first one is to show the significance of Class Outlier Mining by discussing a comparative study between a Class Outlier detection method called Class Outlier Distance Based (CODB) and a conventional Outlier detection method. The second goal is to introduce Enhanced Class Outlier Distance Based (ECODB) algorithm which is enhancement of CODB algorithm. ECODB reduces CODB parameters using a heuristic approach. The experimental results show that CODB can detect Class Outliers that cannot be detected using conventional Outlier detection methods. The experiments also show that ECODB works efficiently as CODB.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparative Analysis of Outlier Detection Techniques

Data Mining simply refers to the extraction of very interesting patterns of the data from the massive data sets. Outlier detection is one of the important aspects of data mining which actually finds out the observations that are deviating from the common expected behavior. Outlier detection and analysis is sometimes known as outlier mining. In this paper, we have tried to provide the broad and ...

متن کامل

Local multivariate outliers as geochemical anomaly halos indicators, a case study: Hamich area, Southern Khorasan, Iran

Anomaly recognition has always been a prominent subject in preliminary geochemical explorations. Among the regional geochemical data processing, there are a range of statistical and data mining techniques as well as different mapping methods, which serve as presentations of the outputs. The outlier’s values are of interest in the investigations where data are gathered under controlled condition...

متن کامل

A Comparative Study of RNN for Outlier Detection in Data Mining

We have proposed replicator neural networks (RNNs) for outlier detection [8]. Here we compare RNN for outlier detection with three other methods using both publicly available statistical datasets (generally small) and data mining datasets (generally much larger and generally real data). The smaller datasets provide insights into the relative strengths and weaknesses of RNNs. The larger datasets...

متن کامل

Applying Artificial Immune System for Outlier Detection: A Comparative Study

Outlier detection is a data mining method for discovering exceptional, abnormal or suspiciously unusual samples in a data set. Outliers typically represent the data rich but information poor dilemma. Data mining methods are applied to solve this problem in broad range of application fields like credit card fraud detection, network intrusion detection, error extraction, clinical disease research...

متن کامل

A Framework for Outlier Detection in Geographic Spatial Data

Outlier detection is very interesting, useful and challenging problem in the field of data mining. Because of sparse data clustering algorithm which are based on distance will not work to find outliers in spatial data. Problem of finding irregular feature in spatial data need to be explore. Many existing approaches have been proposed to overcome the problem of outlier detection in spatial Geogr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009